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HIGHLIGHTS 

• This interview study explores beliefs about the instructional role of illustrations 

• We compared illustrators', teachers', students' and common people's ideas 

• Participants' responses were internally coherent and close to multimedia learning theory 

• We propose and discuss an integrated multimedia learning model 

An interview study, based on specific pictures taken from textbooks used in primary 
schools, was carried out to investigate illustrators', teachers', students', and common 
people's beliefs about the role that illustrations play in facilitating learning. Participants' 
responses were internally coherent, indicating a systematic nature of the underlying naive 
conceptions. Findings disprove Mayer's pessimistic claim that laypersons' conceptions of 
multimedia learning fail to match experimentally supported principles and theories. On 
the contrary, interviewees spontaneously came very close to the multimedia learning 
theory, which states that students learn better from pictures, which fit specific cognitive 
principles. Implications for school instruction are highlighted. 
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INTRODUCTION 

The purpose of the present study is to investigate people's naive 
conceptions about the mental processes supporting the efficacy 
of multimedia artifacts to be employed in instructional settings. 
To explain the rationale underlying the research project, the con- 
cept of multimedia learning will first be described. Then, the 
issue of the folk conceptions about instructional devices will be 
addressed. The third part of the Introduction is aimed at link- 
ing such conceptions to multimedia learning. Finally, we will 
present the specific perspective concerning naive conceptions 
about multimedia learning on which our study is based. 

MULTIMEDIA LEARNING 

The cognitive theory of multimedia learning, proposed by Mayer 
(2001, 2005), can be assumed as a basis to support the bene- 
fits produced by multimedia in school instruction. Such a theory 
has its root in the Multiple Representation Principle (Mayer and 
Anderson, 1992), which states that it is better to present an expla- 
nation in words and pictures than solely in words, since two 
modes of representation allow learners to extract information 
from two different sources (so that, if learners are inefficient in 
managing one of them, they can rely on the other) and to store 
it in two distinct systems (so that, if the verbal track is disrupted, 
the visual one can keep information in memory and vice versa). 

Mayer identified seven specific principles aimed at explain- 
ing how and why multimedia learning is actually effective. 
The spatial contiguity principle states that students learn bet- 
ter when corresponding words and pictures are presented close 
to, rather than far from, each other on the page or on the 
screen. Similarly, the temporal contiguity principle states that 
learning is enhanced when corresponding words and pictures 



are presented simultaneously rather than successively. The coher- 
ence principle claims that students learn better when extraneous 
visual material is excluded rather than included. The modality 
principle states that students learn better from animation and 
narration than from animation and on-screen text. The redun- 
dancy principle asserts that students learn better from animation 
and narration than from animation, narration, and text. Mayer 
further hypothesized an individual difference principle, according 
to which multimedia effects are stronger for low-knowledge than 
for high-knowledge learners and for high-spatial rather than for 
low-spatial learners. 

Schnotz (2005) proposed new principles to integrate those 
proposed by Mayer: the picture-text sequencing principle, which 
postulates that, if a written text and a picture cannot be presented 
simultaneously, it is better to present the picture before the text 
than the other way round; the structure-mapping principle, stat- 
ing that, if a subject matter can be visualized by different pictures 
in different ways that are informationally equivalent, it is better 
to use a picture with the form of visualization that is more appro- 
priate to solve future tasks; the general redundancy principle, that 
suggests not combining texts and pictures if learners have suffi- 
cient prior knowledge and cognitive abilities to construct a mental 
model from one source of information; the control-of -processing 
principle, supporting the claim that, if a static picture is combined 
with text, the text is difficult to understand, and learning time is 
not limited, it is better to use written text rather than spoken text. 

NAIVE CONCEPTIONS ABOUT INSTRUCTIONAL TOOLS 

The features of the instructional devices are important, but we 
have to keep in mind that, in order to implement multimedia 
learning efficiently, attention has to be paid also to teachers' 
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and students' beliefs about the cognitive processes activated by 
the educational tools to be employed (Antonietti and Colombo, 
2008). Such beliefs — as noted by Veenman et al. (2006) — can be 
either correct or wrong. For instance, a teacher might assume 
that pupils can understand some subjects better if verbal descrip- 
tions are accompanied with multi-cultured and realistic photos 
which can increase the learners' curiosity and interest and so 
he/she might choose a textbook including such kind of visual 
materials, which actually fail to match Mayer's coherence princi- 
ple. As another example, a student who has to learn a textbook 
page where words and graphics are simultaneously displayed 
in an interconnected way might think that it is preferable to 
read the whole set of sentences reported on the page first and 
then to look at the graphics, instead of shifting attention repeat- 
edly from each sentence to the corresponding graphic (a process 
which that learner assumes to be too demanding), thus miss- 
ing the advantages coming from the contiguity principle. Hence, 
it seems to be crucial that trainers and trainees share relevant 
beliefs about what is the optimal way to manage educational mul- 
timedia presentations (Kluwe, 1982; Brown, 1987; Nelson et al., 
1998). 

It is well-recognized that people's opinions about the instruc- 
tional tools are not isolated, idiosyncratic beliefs, but rather 
they are usually coherently organized in well-defined concep- 
tions which can influence the entire learning process, especially 
the standards that are set up as goals to be achieved, as well as 
the cognitive operations to be carried out to perform the task at 
hand (Braten and Stromso, 2006; Bromme et al., 2010; Efklides, 
2011). Such conceptions in fact act as an "apprehension struc- 
ture," which anticipates the task to be executed and helps learners 
to adapt themselves to the perceived characteristics of the task 
by metacognitively calibrating to them (Wilson and Bai, 2010). 
Various studies have shown that students with adequate beliefs 
about the instructional tools to be employed and the underlying 
mental processes usually achieve better learning results (Veenman 
et al, 1994, 2002; Antonietti et al, 2000; Crosier et al., 2000; 
Richardson et al., 2002; Strieker et al, 2004; Aydin and Ubuz, 
2010; Fayena-Tawil et al, 2011). 

NAIVE CONCEPTIONS OF MULTIMEDIA LEARNING 

Students appear to be able to conceptualize benefits of using 
successful instructional technology which is supported by an 
adequate pedagogical approach (Mouza and Bell, 2001), such 
as when technology is implemented according to collaborative 
learning principles (Wilson and Whitelock, 1998). Students can 
also judge the quality of instructional materials not by superfi- 
cial features, but by cognitive aspects. For example, students are 
able to judge the level of interaction and the amount of feedback 
offered when working online (Hackman and Walker, 1995). They 
can perceive, in a Web based training environment, clear and con- 
sistent differences between the best courses and the worst, basing 
their evaluation mostly on both fun and test scores (Hassett et al., 
2003). 

As multimedia devices are more specifically concerned, it has 
been shown (Antonietti and Giorgetti, 2004, 2006; Antonietti 
et al., 2008; Jacobson et al, 2010) that students and teachers 
are able to identify a large number of non-trivial opportu- 
nities from multimedia learning and possess well-defined and 



internally-articulated beliefs about the cognitive processes which 
are involved. 

Naive conceptions about learning in multimedia environments 
appears to influence students' performance. Liu et al. (2004) 
reported that learning outcomes are linked to students' beliefs 
about the distinctive features of educational tools. The efficiency 
of the learning process is associated with being aware of the actual 
potentialities of such tools, more precisely with the cognitive 
strategies that they allow students to apply to face the learn- 
ing tasks. Jones (2001) maintained that students' attitudes and 
beliefs can reinforce the strength of visual annotations for gaining 
new knowledge from auditory text, thus promoting a more thor- 
ough processing of the passages and the usefulness of interaction 
with annotations. Huet et al. (2011) described some relationships 
between student's beliefs about help-seeking devices included in 
an interactive Web-based multimedia device and actual perfor- 
mance. Finally, the opportunity to exert control over learning 
in multimedia environments is beneficial if students have a high 
level of self-regulatory abilities (Scheiter and Gerjets, 2007) and 
relevant beliefs about the skills which are involved (Greene et al., 
2010). 

THE COMMON SENSE THEORY OF MULTIMEDIA LEARNING 

Naive conceptions as those mentioned previously are close to the 
«common sense theory^ of multimedia, which refers to the 
beliefs that the general population (made up of non-experts in 
the specific field) shares concerning the properties, uses, advan- 
tages and disadvantages of multimedia (Mayer, 2001, 2005). The 
common sense theory, according to Mayer, should be opposite 
to his assumptions. Such a theory considers the human mind as 
a passive information elaboration system which relies on a sin- 
gle, unlimited processing channel. In Mayer's (2001) opinion, 
the common sense notion is that the main goal of multimedia 
instructional tools is to present information, so corresponding to 
an information-delivery theory. According to this view, if infor- 
mation is presented in the form of words, then presenting the 
same information in pictures adds nothing to learning. 

AIMS 

Mayer (2001) claimed that non-experts have inadequate con- 
ceptions regarding multimedia learning. Basing his conclusions 
on his survey concerning how multimedia devices (and, more 
precisely, combinations of texts and pictures in textbooks) are 
used, he inferred that his cognitive principles regarding multi- 
media learning are not shared by the general population (Mayer, 
1993). From his analysis of textbooks (Levin and Mayer, 1993), 
Mayer concluded that writers and illustrators do not maximize the 
power of graphics to enhance multimedia learning (Mayer, 2001). 
This can be due — Mayer argued — to an ineffective underlying 
naive theory, probably close to the informational-delivery theory. 
Moreover, Clark and Feldon (2005) explored people's questionable 
beliefs, especially concerning expectations of multimedia learn- 
ing, matching them with experimental findings. They showed 
that commonly held principles are not supported by research. 

Might it be that persons' naive conception, in contrast with 
Mayer's claims, do not actually diverge from his theory? Mayer 
never tested his conjectures about the common sense theory, 
focusing his research on the experimentally-tested validity of his 
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principles. The purpose of the present study is to address this 
topic. 

Starting from the assumption that, albeit at different levels, 
all people have at the least some preconceived notions regarding 
the motives which induce designers of multimedia educational 
tools to combine pictures and texts in a given way and that they 
are able to verbalize and justify such ideas, an interview study 
was designed. We can presume that nowadays people are largely 
exposed to a variety of multimedia artifacts, not only in their 
professional domain. Moreover they actually use many such arti- 
facts and, more importantly, they can select the preferred item 
within a category of multimedia tools, and thus it is likely that 
they base their choices on specific beliefs about the quality of the 
item. Finally, mass-media communication spreads the idea that 
we live in an "image society," so leading people to reflect upon the 
meaning of this message. 

The aim was to investigate the conceptions shared by different 
subsamples about the functions and the usefulness of differ- 
ent typologies of illustrations. In this framework we decided to 
explore multimedia products as intended by traditional literacy, 
that is, a product, which combines texts and pictures (Houston, 
1988). 

The main goal of this study, therefore, is to assess naive con- 
ceptions about multimedia learning. The term <?Cnaive;?> is used 
here not to say that these conceptions are intrinsically inadequate 
since they do not correspond to the specialists' claims. Beliefs 
about multimedia learning are labeled as <Cnai've3> since they 
are developed by people who do not have a specific competence 
about the cognitive mechanisms underlying the comprehension 
processes activated by text-picture combinations. Different lev- 
els of naivety can be identified. Laypersons who have no current 
experience with multimedia learning can be placed at the lowest 
level. These persons are not intentionally exposed to multime- 
dia presentations with training aims and so they are not usually 
induced to pay attention to the features of educational multime- 
dia instruments. Their beliefs about such instruments are based 
not on personal engagement with multimedia presentations as 
learners or instructors, but on the indirect knowledge that they 
can derive by looking at them (in bookshops, libraries, adver- 
tisements) or by observing other people (their own sons, for 
instance) using them. We can place students and teachers at an 
intermediate level. Both are using multimedia textbooks as learn- 
ing tools and their concrete experience with them can lead these 
categories of individuals to became aware of the potentialities (by 
realizing when, how, and why learning is facilitated) and limits (by 
perceiving the difficulties they encounter, for instance) of the dif- 
ferent kinds of text-picture combinations, so developing deeper 
beliefs about them. Illustrators can be placed at the highest level. 
They do not possess an explicit, scientifically-grounded expertise 
about the cognitive mechanisms underlying multimedia learning, 
but the job of devising pictures to be associated with texts should 
induce them to reflect about what kinds of images are relevant 
and what are not, where to locate the images, their optimal size, 
colors, and so on. 

The study attempted to investigate three main issues. 

1 . The features of naive conception about multimedia learning. 
As far as this issue is concerned, the aim is descriptive and so 



no specific hypothesis was built. As yet the lack of empirical 
data prevents us expecting what people actually believe about 
the mental processes they think are activated when some- 
one is exposed to a given combination of texts and pictures. 
Attention is focused on two specific aspects: 

a. The cognitive functions attributed to different kinds of pic- 
tures, namely, the beliefs about the role — expressed in 
terms of mental processes (e.g., enjoying, visualizing, con- 
necting, understanding notions) — played by pictures. 

b. The perceived utility of different pictures, that is, the beliefs 
about the match/mismatch between the actual and the 
optimal mental processes activated by pictures. 

2. The internal coherence of the above mentioned conception 
is investigated. Since literature supports the notion that even 
naive people have well-defined and stable beliefs about men- 
tal processes involved in learning tasks, we hypothesize that 
consistent associations among participants' responses given 
to different requests concerning the same issue and rela- 
tionships among different aspects of the conceptions will 
emerge. 

3. Differences among subsamples concerning the conception in 
question. We hypothesize both that higher levels of exper- 
tise lead people to identify the mental processes involved 
in multimedia learning in a more adequate way and that a 
specific expertise in the target field (the use of illustrations 
for educational purposes) influences how individuals repre- 
sent such processes, by stressing the aspects which are more 
linked to their profession (e.g., teachers should emphasize how 
understanding can be improved by pictures). 

METHODS 
THE INTERVIEW 

It was decided to use an interview, rather than a questionnaire 
(as in some previous studies), to investigate people's beliefs, as 
an interview allowed us to go into more detail on each single 
topic, without affecting the respondents' answers. Interviews pro- 
vide in-depth information about a particular research issue or 
question and allow us to analyse the responses using a method 
which provides us with a "big picture" that transcends any one 
single bit of data. This is possible since a qualitative research 
interview seeks to cover both a factual and a meaning level 
(Kvale, 1996), and this appeared to be particularly appropriate 
for our exploratory study. Questionnaires seemed to be less suit- 
able for our aims, since they may influence respondents' answers, 
as, by their nature, they organize the target topics a priori. 
Moreover, questionnaires only ask for agreement/disagreement 
on general statements. Since it is unlikely that people state their 
beliefs as general statements, it is easier for them to express such 
beliefs with reference to specific situations — as provided by our 
interview — rather than as preset, generalized statements. Thus, 
we tried to encourage people to declare their opinions freely 
and on specific and concrete cases. Furthermore, as the inter- 
view was based on specific pictures taken from actual textbooks, 
the interview format was deemed likely to secure more valid 
data because of the greater ecological validity of the employed 
materials. Considering again the fact that beliefs are often not 
available as explicit abstract conceptualizations, our interview did 
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not require participants to give only verbal answers, but also asked 
them to respond through practical actions, such as selecting pic- 
tures and placing them in such a way as to create an original 
product. 

Each participant was asked to answer a series of questions 
within the same interview, which was structured according to 
the criteria explained below. Materials used during the interview 
were: (1) a booklet, which included a selection of textbook pages; 
(2) the materials for the Perfect Book; (3) a filing record. 

Booklet 

To build his functional classification, Mayer (1993) consid- 
ered 6 sixth-grade science textbooks approved for adoption in 
California. To assess the exact typology of each image, Mayer 
referred to Levin's (1989) functional classification. In accor- 
dance with the author's cognitive theory, the category the pic- 
tures belong to may elicit different cognitive processes. More 
precisely: 

1. Decorative pictures: they are not directly relevant to the text 
and should not have any effect on any cognitive process. 

2. Representational pictures: they portray one element that is 
mentioned in the text and should influence the process of 
selection. 

3. Organizational pictures: they represent the relations among 
elements in the text and should influence the process of 
selection and organization. 

4. Explanative pictures: they explain how something (e.g., a sys- 
tem, a process, a mechanism) works and should influence the 
process of selection, organization, and integration. 

To build the research protocol we decided to focus on primary 
schools, where the use of illustrations to match text is greater 
(Colombo et al„ 2009), so as to be suitable for the still develop- 
ing skills of reading texts in children. Unlike Mayer (1993), we 
did not focus only on one grade or age group, but considered 
textbooks related to all primary school grades (that is, accord- 
ing to the Italian school system, from the first to the fifth grade). 
Moreover, we also decided to examine not a single discipline, but 
the full range of subjects introduced in primary school, consid- 
ering the fact that people might have subject-specific conceptions 
about the use of illustrations. 

To select the textbooks to analyse, reference was made to the 
official classification 1 of the books most frequently chosen by 
Italian teachers during the 2004/2005 school year. Data for this 
study has been collected during 2005/2006 academic year — so this 
information was the more appropriate to use for the book selec- 
tion. As in Italy it is the teacher of each individual school who 
actually chooses the texts to be employed for the different grades 
each year, such a list should in fact reflect the true preferences of 
Italian teachers. Textbooks for every single class of primary school 
were taken into account. For each textbook, the space devoted 



1 This classification was compiled by Italian Ministry of Education on the basis 
of the choices made by teachers and communicated by schools to the Ministry 
itself, which thus constitutes a valid source of this kind of data. 



respectively to pictures and texts was quantified, while all the 
illustrations were classified 2 . 

The analysis of pictures was carried out on the basis of Mayer's 
classification (1993) and, hence, the iconic material was divided 
into Decorative (D), Representational (R), Organizational (O), 
and Explanative (E) illustrations. During an initial pilot anal- 
ysis, however, it was felt that some categories needed to be 
specified further. Specifically, while examining Decorative and 
Representational pictures, differences were found within the same 
category. Thus, it was decided to split the Decorative category 
into two distinct subcategories: first-level (Dl), with no refer- 
ence at all to the text, and second-level (D2), where analogies 
or connections with the text are present, albeit weak. Examples 
of Dl illustrations include all those elements that decorate the 
page, such as page frames and borders which often fill the empty 
spaces. D2 illustrations, on the other hand, are slightly more 
appropriate images, such as the picture of children playing in 
the woods placed close to a text referring to photosynthesis. 
While this picture does not represent any part of the text, a 
weak reference to the general topic can be found. The distinction 
outlined for Representational pictures was as follows: first-level 
Representational pictures (Rl), which actually represent parts of 
the text, and second-level Representational pictures (R2), which 
represent additional elements, that is, elements that somehow 
enrich or integrate the text. An example of Rl pictures may be 
recognized in an illustrated tale: each picture representing a scene 
of the story is a first-level Representational picture, since it shows 
a part of the text. An R2 picture, on the other hand, is an illus- 
tration that replaces the text or that adds something to the text, 
such as the photo or drawing of different types of leaves, with 
a detailed caption explaining the differences among them, placed 
in close proximity to a text explaining vegetation, which mentions 
trees and leaves but does not provide details on the different types 
of leaves (see Table 1). 

The pilot categorization of the illustrations led us to devise 
a structured system of classification — based on the distinction 
among Dl, D2, Rl, R2, O, and E pictures — which included a clear 
operational definition of each kind of image. Such a definition 
was expressed in terms of conditional clauses (e.g., "if the pic- 
ture depicts an item mentioned in the corresponding text, then 
exclude Dl and D2 categories and consider the other categories") 
so as to produce a sort of decision tree which avoids ambiguity 



2 To better explain our methodology in this first phase of the research, it is 
important to note that in Italian primary schools there is a difference between 
the structure of the first two grades (first cycle, where a more integrated struc- 
ture of disciplines is planned) and the last three grades (second cycle, where, 
as a kind of transition to a more advanced grade, a distinction between dis- 
ciplinary subjects and reading activity is made). This distinction is reflected 
in the textbooks: whereas texts for the first two grades are often conceived 
as a whole (even though they are sometimes made up of several volumes, 
for teachers' and pupils' convenience of use), texts for the last three grades 
are divided into two different units: disciplinary subjects and reading activi- 
ties. The distinction is quite clear-cut, especially for the last two years when, 
to suit the increasing learning demands of the growing pupils, the different 
units are grouped into different textbooks, so that often books from different 
publishing houses are chosen for the two areas in the same class. For these 
classes, textbooks both for disciplinary subjects and for reading activities were 
examined in the survey. 
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Table 1 | Examples of pictures for each category (all illustrations refer 
to the same text which describes the photosynthesis). 



Types of Illustration 



Example 



Decorative I level 

Decorative Image-no reference 
to the text 

Decorative II level 

Decorative Image-analogies or 
connections with the text are 
present, albeit weak 



Representational I level 

Represent parts of the text 



Representational II level 

Represent additional elements 
that enrich or integrate the text 



Organizational 

Represent the relations among 
elements in the text 



Explanative 

Explain how something works 






and hence allows encoders to reach a unanimous judgment. It is 
worth noting that the classification system we devised is metacog- 
nitive in its nature: the different types of pictures were defined 
according to the mental process they involve, which is the basis of 
its function in learning (Table 2). 

After having devised the classification system, the actual cate- 
gorization took place. All pictures were coded by two previously 
trained judges and assigned to the corresponding categories. The 
agreement between the two judges was 100%. 

After analysing the textbooks, we devised the actual booklet 
to be used in the interview, which consisted of 36 pages, derived 
from all the textbooks analysed. The page selection criteria were: 



• Representativeness: 



3. For each topic, the number of pages devoted to it (geog- 
raphy, history, maths, arts, language, science) were roughly 
equivalent. 

4. For each grade, at least two examples of each category were 
included (Dl, D2, Rl, R2, O, and E). 

• Economy of space: pages distinguished by the simultaneous 
presence of pictures belonging to different categories 3 were 
favored. 

• Meaningfulness: pictures clearly consistent or inconsistent with 
Mayer's principle (specifically, the spatial contiguity princi- 
ple, since the other principles were not applicable to our 
specific material. Mayer's principles were devised for multime- 
dia computer-supported presentations and would hardly have 
been transferable as a whole to text pages) were inserted in the 
booklet. To determine whether an illustration was or was not 
consistent with Mayer's spatial contiguity principle, Mayer's 
definition was taken into account: the pictures selected by the 
researchers were analysed to see if they met all the conditions 
considered as distinctive points, such as the proximity of one 
or more illustrations to the corresponding test sections and 
the relevance of those illustrations to the text being referred 
to. If they met all the conditions mentioned in Mayer's princi- 
ple, they were considered to be consistent with the principle. If 
those points were not met, the picture was not considered to 
be consistent. The agreement between the two researchers who 
checked the coherence/incoherence with the contiguity princi- 
ple was 98%. Ambiguous cases were evaluated and solved by 
the judges. 

Overall, 95 categorized images in the booklet were reported and 
divided among functional categories: 10 Dl, 13 D2, 14 Rl, 18 R2, 
22 O, and 19 E pictures. The number of pictures for the differ- 
ent categories was not the same for each subcategory, as the total 
number of selected illustrations was proportional to the number 
of pictures in each category in the analysed textbooks. It is also 
important to note that Dl and D2 pictures and Rl and R2 pictures 
were each considered (for the total amount) as a single cate- 
gory. That is to say, we considered Decorative pictures as a whole 
(bringing the number of Decorative pictures to a total of 23) 
and Representational pictures as a whole (bringing the number 
of Representational pictures to a total of 32). The sequence of the 
selected pictures in the booklet was randomly set. 

It is important to highlight that all the pages selected to com- 
pose the booklet were always full pages, which were not related 
to any other pages, but were autonomous in their meaning. This 
selection had been made so as not to influence interviewees' 
performance or interpretation. 

Materials for the perfect book 

The materials were as follows: 



1 . For each grade, texts appearing in the first, second, and third 
place in the official classification of the textbooks approved for 
use in the Italian school system were considered. 

2. For each textbook type (single text for grades 1-3 and read- 
ing and disciplinary textbooks for grades 4-5), pages from 
textbooks for all grades were included. 



• Two white cards (size slightly above A4). They represented the 
hypothetical pages of an ideal book; 



A page with one illustration only was also obviously included. This was done 
to balance the spontaneous comparison among pictures in the same page with 
the evaluation of single illustrations. 
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Table 2 | Taxonomy of the categories of pictures according to their cognitive functions. 



DOM the picture allow tlic 
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DOM the picture allow ihc 
lea.: i .': to Jul ::ivc I le 
relationships between die 
represented item and other 
elements mentioned in the 
text? 



yes 



Does the picture allow the 
learner to understand how 
the system of elements 
mentioned in the text 
works? 



|_yjj 

m 



no 



LjJ in 
J ■ to 
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anything which is related 
the text? 




no 



0 Lric pet Li.* produce 

an impressive 
representation of the item 
mentioned in Ihc text m 
the learner' s trend? 



00 



ye s 



no 



Does the picture stress the 
relationships between the 
represented item and other 
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text in order to facilitate 
the execution of an 
exereiNC. 1 



oo 



ye s 




Docs the picture activate 
m the learner' % BU the 
scenario :l: *.e.\t? 




Docs the piciurc allow 
the learner to represent 

vluu.U 4tU:l:ui al 

elcnxrr.ts of the topic of 
the text? 




MN 






0 



• A brief didactic text relevant to primary school (telling about 
vegetation and photosynthesis). The text was composed by 
elaborating upon an actual text found in the examined text- 
books; 

• Fourteen illustrations belonging to different categories, specif- 
ically: 2 Dl, 2 D2, 2 Rl, 2 R2, 3 O, and 3 E pictures. 

The filing record 

It was composed of three parts. 

(a) A section for collecting general information about the partic- 
ipant (gender, age, occupation/study course). 

(b) Grid of picture analysis. The interviewee was presented 
with the booklet described above (see point 1). Each page 
had previously been categorized according to the crite- 
ria given. The interviewer, on his/her part, had a grid in 
which the different pictures for each page of the book- 
let were labeled by means of abbreviations and which 
provided space to record the interviewee's answers (see 
below). 



(c) The record of the most- and least-useful pictures and of the 
Perfect Book. At the end of the protocol there was room to 
record the answers to these last two requests (see below). 

SAMPLE 

One of the goals of the study was to identify the naive theories of 
people who are differently experienced with the text-illustrations 
relationship. As stated previously, the targets of the study were 
illustrators of children's literature, teachers, and students; that is, 
the kinds of persons who, respectively, create, choose, and use 
textbooks. They are experienced in the field of multimedia (with 
the basic meaning of artifacts where text and pictures are used 
together) but are not trained in the corresponding specific cog- 
nitive theories. A group of adults with no formal experience in 
multimedia use constituted the control subsample. 

Four sub-samples of participants were created, each one cor- 
responding to a different category of subjects. The interviewees 
were all volunteers who had been contacted personally by the 
researchers and asked about their interest in the research topic 
and their willingness to join the study. They were neither paid 
nor given any academic credits for their participation. 
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Illustrators (N = 16; men = 4, women = 12; age range = 

21- 62 years, mean age = 39.25 years, SD = 12.52): this subsam- 
ple included people with work experience in the development of 
multimedia products. The majority of the participants in this sub- 
sample were illustrators specializing in (text) books for children. 
Graphic designers (who however had a basic training in the illus- 
tration field), who generally possess a broader experience in the 
field of multimedia products (such as Web-sites or didactic soft- 
ware), were annexed to the group to balance the sample, which 
would otherwise have been comprised solely of illustrators used to 
working mainly with paper-and-pencil tools and only occasion- 
ally with technological tools. The illustrators, listed in the annual 
of the members of the Italian National Illustrator Society, were 
recruited by means of a letter in which they were asked if they 
would be interested in joining the investigation. 

Teachers (N = 18; men = 4, women = 14; age range = 22-53 
years, mean age = 37.22 years, SD = 11.04): this subsample was 
made up of primary school teachers. In addition, if they proved to 
have more than 1 year of teaching experience within school insti- 
tutions, students attending the last year of the degree to become 
Primary school teachers were also included in this sub-sample. 
These teachers, employed in Italian private and public primary 
schools, were recruited by means of a letter asking them about 
their willingness to join the study. Students were selected among 
those who subscribed to a newsletter of a service of learning psy- 
chology which offers information and counseling to teachers of 
all grades. 

Students (N = 18; men = 7, women =11; age range = 19-32 
years, mean age = 22.83 years, SD = 3.17): this category com- 
prised university students attending different faculties in univer- 
sities located in Milan and was evenly divided between students of 
science and of arts courses. This specific sub-sample was consid- 
ered because, in his research, Mayer had used mostly university 
students as participants and had based his assumptions about 
the common-sense theory on data collected working with them. 
Thus it was considered interesting and appropriate to exam- 
ine and compare naive theories also from this particular group. 
Moreover, learners appear to make better decisions than teachers 
or instructional designers because they have a bigger stake in the 
educational outcome and intimate knowledge of their learning 
preferences (Niemiec et al., 1996). The students were recruited by 
means of a notice placed on noticeboards in the university cam- 
puses. The selected students had no previous knowledge either of 
learning or cognitive psychology, or of multimedia learning. 

(Control) adults (N = 16; men = 8, women = 8; age range = 

22- 61 years, mean age = 32.94 years, SD = 11.20): this group 
included adults with jobs that do not require specific multimedia 
competence or experience with reference to the multimedia mate- 
rial employed in the present study. More precisely, these jobs were: 
office worker (8), maintenance man (1), physician (1), nurse (1), 
engineer (5). They were recruited by means of a notice placed 
on noticeboards for the administrative and technical staff of the 
university campuses. 

PROCEDURE 

The procedure followed by the researcher while administering 
the interview was structured as follows. Each participant was 



individually interviewed. Before starting the interview, a brief 
explanation about the goals of the study was given to the partici- 
pant. It was explained that the study was aimed at investigating 
the text-picture relationship in primary school textbooks and 
that researchers were interested in exploring participants' percep- 
tions of this relationship. Interviewees were told explicitly that 
the questions were not aimed at any sort of evaluation of the 
participants and that, therefore, no answers could be considered 
either right or wrong. Then the interview procedure was briefly 
explained. 

Afterwards, as reported in the protocol description, data aimed 
at describing the participant were recorded. After these prelimi- 
nary phases, three requests were made. 

First request 

The interviewer presented the participant with the booklet. 
Before beginning the interview, he/she specified to the partici- 
pants that in the study <^picturesj?> referred to both illustrations 
and graphical elements such as schemata. The interviewee was 
then asked to examine the illustrations on each page and to 
answer two questions: 

^CWhat, in your opinion, is the purpose of each different 
picture?J?> 

<C Assign, for each illustration, a score between —2 and +2, 
referring to the level of usefulness/uselessness you ascribe to the 
picture^ 4 . 

The first question was aimed at outlining the subjects' naive 
conceptions concerning the functions of pictures expressed in 
terms of mental processes involved during learning. The second 
question was meant to highlight people's beliefs concerning the 
usefulness of the different typologies of illustrations by inducing 
them to match the mental process actually activated by pictures 
to the alleged optimal process. 

Second request 

In order to understand which naive theories concerning the 
efficacy of text-picture combination people tend to share, the 
interviewees were asked to indicate which illustrations they con- 
sidered to be the most and the least useful, after they had 
evaluated all the booklet's pages and seen the illustrations dur- 
ing the first part of the interview. Such a request involved a 
form of comparison with the naive conception resulting from 
the first request. Forcing a direct comparison between pictures 
and leading the respondents to divide the images into two dis- 
tinct categories, we tried to confirm, or disconfirm, the more 
general evaluation given in the first phase. In the second part 
of the first request, participants were asked to rate each illus- 
tration singly, stating the level of usefulness for each of them. 



4 Originally it had been proposed to use a five-point scale from 1-5. After an 
exploratory preliminary administration, participants were asked, at the end 
of the interview, for comments on the procedure. It was noticed that the 
interviewees who had difficulty rating the pictures were seemingly unable to 
calibrate their judgements. To facilitate this task the researchers decided to 
transpose the scale using values from —2 to +2, hypothesising that the differ- 
ence between positive and negative values could help the respondent. Another 
pilot phase confirmed this hypothesis. Therefore this version of the scale was 
maintained in the main study. 
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This second request was more general (asking the intervie- 
wees to give an overall evaluation of the usefulness/uselessness 
of the pictures). It was aimed at integrating the first evalua- 
tion by adding data coming from a more general evaluation. 
This second assessment was imagined to be more focused on 
general characteristics of the mental processes elicited by illus- 
trations and less on the specific processes elicited by single 
pictures. 

Third request 

The interviewee was asked to build his/her Perfect Book, using all 
or some of the provided material mentioned above. The text on 
photosynthesis was divided into two pages and the interviewee 
was free to place both the textual and visual information in the 
order and the amount he/she wished on the pages. The arrange- 
ment of illustrations on the page was also left to the interviewee, 
who was free to place the chosen pictures and the correspond- 
ing text where he/she believed them to be most useful, easily 
read, or best comprehended. The researcher then reproduced the 
final composition on the filing record. More precisely, he/she 
schematized the general structure (i.e., the arrangement of the 
elements) chosen by the participant and assigned a code (cor- 
responding to the category to which each picture belonged and 
a progressive number for each portion of text) to each picture 
and piece of text chosen by the interviewee. The goal of this 
task was to highlight the possible coherence of naive theories 
with Mayer's spatial contiguity principle, as well as provide a 
comparison with the statements of the first and second requests 
in order to assess the internal consistency of the respondents' 
beliefs. 

The following schema shows the correspondence between 
the research issues and the steps of the interview (X indicates 
that the question was considered in that phase of the interview). 

Research focus Requests to interviewees 

State the aim State the Build the 

of each picture most/least useful Perfect Book 
(first request) picture typology (third 

(second request) request) 

(1a). Description X 
of the nai've 
conception: 
functions of the 
pictures 



(1 b). Description 
of the nai've 
conception: utility 
of the pictures 


X 


X 


(2). Internal 


X 


X 


coherence of the 






nai've conception 






(3). Differences X 


X 


X 



among 
subsamples 
concerning nai've 
conception 



Two researchers carried out the interviews. Both have been 
trained to ask the questions and to record the answers according 
to a standard procedure. 

DATA ANALYSIS 

Overall design of the analyses carried out 

The collected data were presented in an order which mirrors the 
structure of the filing record explained previously and were anal- 
ysed according to the three main research issues mentioned at the 
end of the section "Objectives of the study" so as to answer the 
specific questions listed below: 

(1) Features of the conceptions about multimedia learning 

(la) Cognitive functions attributed to pictures 

• What mental functions do interviewees ascribe to 
different pictures? 

• Is the function classification identified by the partic- 
ipants similar to Mayer's? 

(lb) Perceived utility of pictures 

• Do utility scores differ according to pictures' cogni- 
tive functions? 

• Which pictures are perceived as the most and least 
useful? 

• Do the utility scores assigned by the participants 
match Mayer's model? 

(2) Internal coherence of the conceptions about multimedia 
learning 

• Is it possible to identify a general coherence among use- 
fulness scores and between the general scores attributed to 
pictures and the composition of the Perfect Book? 

(3) Differences among sub-samples concerning naive conception 
about multimedia learning 

• Does any significant difference emerge among sub- 
samples in function attribution? 

• Does any significant difference emerge among sub- 
samples in utility scores? 

Data encoding 

Opinions concerning the purpose of pictures (first request) were 
coded into 11 functional categories. The system of categories 
devised by Mayer (1993), with the extensions and modifications 
imposed in our study, appeared to be a good point from which 
to begin classifying the respondents' beliefs about the function of 
each picture in promoting pupils' learning. However, an initial 
examination of the interviewees' answers suggested that func- 
tions not included or foreseen in Mayer's system had emerged. 
Thus, some additional categories were needed. Distinctive char- 
acteristics expressed by the interviewees, and not corresponding 
to the elements of the functional categories previously considered, 
were registered, examined, and organized into four new categories 
that had not been considered by Mayer. Categories 1-7 corre- 
spond to our original classification derived from Mayer's own 
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classification, whereas categories 8-1 1 are based on the additional 
data gathered regarding participants' beliefs 5 . 
The four additional categories are as follows: 

• Contextualizing (C): pictures which, at various levels, introduce 
the main topic. These help the student to link the text con- 
tent to his/her everyday life, or to make some unknown aspect 
of experience more familiar (for example, the photo of a girl 
eating an ice-cream next to a text explaining the taste buds). 
Illustrations that characterize or identify a specific subject or 
activity can also be related to this category. 

• Motivational (M): illustrations which excite curiosity or elicit 
emotions and attract the student's attention so as to induce 
him/her to read the text or perform the required activ- 
ity. Images which stimulate the identification of the student, 
involving him/her from an emotional point of view are also 
included in this category (for example, cartoon characters or 
cartoons put beside a maths exercise or explanation). 

• Operational (OP): pictures which are an essential and needful 
part of an exercise or activity. In this case it is not the iconic 
typology that has to be taken into account, but rather the final 
purpose of the illustration within the context of a specific activ- 
ity (for example, illustrations of the single parts of a practical 
activity to be performed by students). The interviewee, when 
referring to this category, stressed its relevance to the activity 
of which it is a part. 

• Mnemonic (MN): pictures which allow students to activate 
memorization strategies by using mental visualization (or 
more general visual strategies) or by using concrete objects as 
an aid in maths exercises. 

This system of categories — as well the distinctions between Dl, 
D2, Rl, R2, O, and E pictures — was converted into a set of unam- 
biguous operational definitions, expressed through conditional 
clauses, hierarchically organized in a decision tree so as to allows 
encoders to apply the same classification criteria. 

The coding procedure was carried out by two independent 
judges, who were trained to apply the above mentioned categories 
to the interviewee's answers. All responses had been read by the 
two judges and assigned to different categories. Responses were 
then re-examined and the assignment to the specific categories 
had to be confirmed. The agreement between the two judges was 
98%. Ambiguous cases were evaluated by the two judges together 
in order to reach a unanimous decision. 

RESULTS 

FEATURES OF THE CONCEPTIONS ABOUT MULTIMEDIA LEARNING 

Pictures ' functions 

Analyses were carried out separately considering textbooks 
of each grade of the primary school and the distinction 
between reading and disciplinary textbooks. Since the pictures 
that emerged overlapped, we collapsed data and so we will 



Two other categories (0 and 12) were coded. They indicate, respectively, a 
picture that was not considered and an explicit statement of the considered 
illustration's lack of function. 



present results concerning the whole set of textbooks con- 
sidered without distinguishing among grades and kinds of 
books. 

Table 3 reports the frequencies and percentages of partici- 
pants' function attribution. The types of functions ascribed to the 
illustrations by the interviewees were compared with Mayer's clas- 
sification in order to highlight correspondences and discrepan- 
cies. The columns show how the pictures were grouped according 
to Mayer's classification and the percentages of these groupings, 
whereas the rows show the same pictures grouped according to 
interviewees' attributed functions. For example, 59 pictures clas- 
sified as Dl according to Mayer's classification were considered as 
Decorative by participants. 

The gray cells in the table denote where the majority of choices 
would be grouped if participants' categories completely corre- 
sponded with Mayer's classification. The bold emphasis indicates 
where the highest frequencies occurred. Z-value tells us how 
many standard deviations above or below the mean our value x 
is: positive z-values are above the mean, whereas negative z- values 
are below the mean. 

A log-linear analysis was performed on the frequency data in 
Table 3. Given the specific data and the number of variables, we 
decided to test the data using a log-linear analysis, since One-Way 
ANOVAs and chi-square tests would not have allowed us to con- 
sider the interactions among the different variables. The analysis 
resulted in a hierarchical log-linear model, where the interac- 
tion between the variables was significant [X(6n) = 5104.92, p < 
0.001]. As it appears from the data reported in Table 3, Rl, E, 
and O were categories where a good correspondence with Mayer's 
functional classification can be found. The R2 pictures were quite 
correctly detected by many participants, as were the OP ones. 
This can probably be ascribed to the massive use of R2 pic- 
tures (those which, as explained above, integrate or stand in for 
the text) to complete, describe, or enrich exercises or practical 
applications. 

The remaining categories (Dl and D2) were classified by par- 
ticipants in a different way. More precisely, D 1 pictures were, on 
the whole, not considered by participants. Those who actually 
considered them did not see them as decorative, but ascribed to 
them a motivational function. D2 pictures were more frequently 
considered by participants, who, however, did not seem to have 
a clear idea of their function. The majority (which, nonethe- 
less, coincides with only 17.65% of the answers) saw them as 
C pictures, most likely because several D2 pictures were pho- 
tos that emphasized the link with concrete experience. Many 
participants saw these pictures also as Representative because 
of a weak connection with the content of the corresponding 
text. 

Rl pictures were mostly ascribed to the MN category. This 
attribution is due to the fact that many pictures related to this 
category were used to represent texts of mathematical problems. 
Therefore, they might appear to the participants as providing 
suitable mental strategies to better comprehend and solve prob- 
lems. It is worthwhile noting that 16.6% of interviewees classified 
the Rl pictures as R2 ones, implying that participants tended 
to attribute a greater function to the role of the picture than it 
actually had. 
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Table 3 | Frequencies and percentages of function attributions. 



D1 



Functional categories 



D2 



R1 



R2 



Not considered 



D1 



D2 



263 (z = 15.36) 
42.21% 

59 (z = 8.69) 
9.47% 

28 (z = 6.48) 
4.49% 

3 

0.48% 



124 (z=2.79) 
15.20% 

103 (z = 9.19) 
12.62% 

34 (z = 5.32) 
4.17% 

24 (z= 5.80) 
2.94% 



26 (z= -9.25) 
3.35% 

55 (2=3.15) 
708% 

11 

1.42% 
11 

1.42% 



46 (z = 
4.23% 

22 

2.02% 
4 

0.37% 
3 

0.28% 



-6.17) 



228 (z= 7.07) 
15.44% 

3 (z = -4.42) 
0.20% 

0 (z = -2.06) 

0% 

1 

0.07% 



37 (z = -3.54) 
2.89% 

5 (z = -3.46) 
0.39% 

0 

0% 
2 

0.16% 



12 (z = 
1.93% 



-4.65) 



727 (z = 4.93) 
15.56% 



306 (z= 12.48) 
29.06% 



110 (z = 2.17) 
10.11% 



44 (z = -2.50) 
2.98% 



35 (z = -2.43) 
2.73% 



R2 



9(z=- 
1.44% 



■5.62) 



83 

10.17% 



129 

16.60% 



224 (z =8.17) 
20.59% 



96 

6.50% 



77 

6.02% 



1 (z=- 
0.16% 



■3.09) 



3 (z = -4.07) 
0.37% 



9 (z = -3.58) 
1.16% 



20 

1.84% 



392 (z= 14.65) 
26.54% 



121 (z = 9.77) 
9.45% 



Contextualization 



Motivational 



Operational 



Visual 



9(z=- 
1.44% 

55 (2 = 
8.83% 

106 (z = 
17.01% 

15 (z = - 
2.41 % 

5(z=- 
0.80% 



-6.14) 



2.90) 



11.32) 



■2.49) 



•5.50) 



26 (2 = -7.26) 
3.19% 

144 (z = 6.65) 
17.65% 

66 (2 = 2.79) 
8.09% 

2 (z = -6.23) 
0.24% 

27 (z = -5.17) 
3.31% 



92 (z=-2.55) 
11.84% 

76 

9.78% 

88 (z = 3.22) 
11.33% 

60 

772% 

158(z = 4.94) 
20.33% 



95 

8.73% 

135 (2 = 4.55) 
12.41% 

29 (2 = -3.65) 
2.66% 

224 (z = 9.20) 
20.59% 

141 (z = 4.59) 
12.96% 



263 (z = 7.67) 
17.81% 

34 (z = -3.62) 
2.30% 

16 (2= -4.21) 
1.08% 

215 (z = 8.59) 
14.56% 

146 (2 = 5.27) 
9.89% 



688 (z= 17.18) 
53.75% 

24 (z= -3.95) 
1.86% 

17 (2 = -3.07) 
1.33% 

124 (z = 6.58) 
9.69% 

125 (z = 5.80) 
9.77% 



No function 



58 (z = 
9.31 % 



6.62) 



53 

6.49% 



32 (2= -3.56) 
4.12% 



35 (2= -2.33) 
3.22% 



39 

2.64% 



25 

1.95% 



z values between 1.65 and 2.32: p < 0.05; z 
3.11: p< 0.001. 



values between 2.33 and 2.57: p < 0.07; z values between 2.58 and 3. 10: p < 0.005; z valueas equal or higher than 



Pictures' utility 

Utility scores: For each illustration, participants were asked to 
assign a score between —2 and +2, according to their perception 
of the picture's level of usefulness/uselessness. To analyse the 
mean utility scores assigned to the different picture categories, the 
scores were converted into a 1 to 5-point scale. The overall scores 
have been computed and the mean and standard deviation for 
each category have been calculated (Table 4). The AN OVA (com- 
puted by considering the categories as the independent variable) 
showed that utility scores differed significantly among categories, 
F(5, 402) = 165.51; p < 0.0001. Post-hoc Turkey tests showed that, 
excepting the Rl and R2 pictures, all the differences among 
picture categories were statistically significant. This led us to con- 
clude that interviewees were able to discriminate among picture 
categories while rating them. 

From data reported in Table 4, it can be seen that Dl pic- 
tures were considered to be less useful, whereas the E pictures 



Table 4 | Means and SDs of utility scores attributed to different 
picture categories. 



Category 


Mean 


SD 


D1 


1.90 


0.74 


D2 


3.01 


0.56 


fl1 


3.96 


0.46 


R2 


3.99 


0.49 


O 


3.66 


0.57 


E 


4.26 


0.52 



were perceived as being the most useful. Dl pictures had the 
highest standard deviation: this can be explained by the fact that 
most subjects tended either to ignore such pictures or to ascribe 
them a positive score when actually considering them. D2 pictures 
collected a much higher utility score compared to Dl pictures, 
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maybe because our interviewees focused more on the possible 
connection with the corresponding text. O pictures were rated 
lower than R pictures: maybe this lower rate was due to the fact 
that many O images, being similar to schemata and/or graphs, 
were not recognized as proper illustrations by many participants 
(actually O pictures had the highest SD after the Dl ones). Yet 
those who considered them rated them highly. 

Most and least useful images: In the second phase of the inter- 
view participants were asked to indicate the pictures which they 
considered to be the most and least useful. Data resulting from the 
analysis of interviewees' responses seemed to be consistent with 
the utility scores assigned to each picture and described above. 
Considering this second request as a direct confirmation of the 
first request, we decided to focus on the participants' attribution 
of functions. 

The pictures defined as the most useful (Table 5) are the E pic- 
tures. High percentages were also found for the O, C, and MN 
pictures. None of the interviewees designated the Decorative pic- 
tures as the most useful typology. Differences in the distribution 
of the illustrations mentioned as the most useful were significant 
[^ 8 , N = 68 )= 44.5,p< 0.001]. 

The majority of the interviewees indicated the Decorative 
pictures as being the least useful (Table 5). Other categories men- 
tioned in response to this question included pictures defined as 
"useless" from the outset and those not considered as fulfilling 
their function (e.g., decorative pictures which do not properly 
represent the text referred to) [x?i3 n=68) = 97.94, p < 0.001]. 
Respondents did not select pictures categorized as E to answer 
this question. 

INTERNAL COHERENCE OF NAIVE CONCEPTION ABOUT MULTIMEDIA 
LEARNING 

Almost all the participants (91.2%) used both pages to com- 
pose the Perfect Book. The number of images used was, for most 
interviewees, about half of the provided illustrations: 23.5% of 
the participants used 7 pictures out of the 15 total available, 
17.6% used 6 pictures, and another 17.6% used 8 of the pro- 
posed images. Only 13.2% used 4 pictures, while 2.9% used 9. 
The main trend was to use 2 O (72.1%), 1 (41.2%) or 2 (50%) E 
and R (1 picture = 36.8%, 2 pictures = 39.7%) and one (30.9%) 
or no (41.2%) D illustration. All participants observed the spatial 
contiguity principle. 

A strong coherence between the perceived utility among 
the three requests emerged and is summarized in Table 6. 
Considering the different requests, it was not possible to match 
answers and requests perfectly. All of the answers were split into 
three principal categories: High utility (corresponding to those 
pictures used most often to compose the Perfect Book, those pic- 
tures rated as most useful, and those categories that gained the 
highest mean of utility scores); Medium utility (corresponding 
to those pictures used less often than others to create the Perfect 
Book and those categories that gained a medium mean of utility 
scores); Low utility (corresponding to those pictures used the least 
to make up the Perfect Book, the pictures rated as least useful, and 
those categories that gained the lowest mean of utility scores). 

What emerges is that there is not only a strong coherence 
among the three requests, but also a generalized model of picture 



Table 5 | Frequency of pictures rated as most and least useful. 



Most useful images 


Frequency 


Percentage 


No image 


1 


1.5 


Representational 1 


7 


10.3 


Representational 2 


3 


4.4 


Organizational 


10 


14.7 


Explicative 


23 


33.8 


Contextualization 


9 


13.2 


Emotional-motivational 


4 


5.9 


Operational 


4 


5.9 


Visual-mnemonical 


7 


10.3 


Total 


68 


100.0 


X?o u cm = 44.5, p < 0.001 

A (8, N = 68) ' " 






Least useful images 


Frequency 


Percentage 


-Emotional-motivational* 


1 


1.5 


-Contextualization* 


1 


1.5 


-Explicative* 


1 


1.5 


-Organizational* 


1 


1.5 


-Representational 1 * 


5 


7.4 


No image 


3 


4.4 


Decorative 


16 


23.5 


Decorative 1 


20 


29.4 


Decorative 2 


3 


4.4 


Representational 1 


4 


5.9 


Organizational 


1 


1.5 


Contextualization 


2 


2.9 


Emotional-motivational 


1 


1.5 


Images with no function 


9 


13.2 


Total 


68 


100.0 



X03,N = 68)= 97 - 94 .P< 0 - 001 



"In this table, categories preceded by the sign "-" refer to the non-achievement 
of the function. For example, "- Representational 1 " refers to a missed 
Representational picture of first level, namely, a Representational picture of first 
level which is not able to adequately represent the text it refers to. 

utility, which mirrors Mayer's classification. The fact that all the 
participants spontaneously observed the spatial contiguity prin- 
ciple while composing the Perfect Book leads us to think that this 
principle, too, was implicitly perceived as valid for our sample. 

DIFFERENCES AMONG SUB-SAMPLES CONCERNING NAIVE 
CONCEPTION ABOUT MULTIMEDIA LEARNING 

Pictures' functions 

Frequencies and percentages of category attribution for each 
group of participants are reported in Tables 7-10. Mayer's func- 
tional categories were compared with the categories ascribed by 
participants (as was previously done for the whole sample) to 
highlight correspondences and discrepancies between the two 
positions. Gray cells denote where the majority of the choices 
would be grouped, should participants' categories completely cor- 
respond with Mayer's categorization. Bold emphasis indicates 
where the majority is actually grouped. 

A log-linear analysis was performed on the frequency data 
reported in the four tables. In all cases the analysis specified a 
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hierarchical log-linear model, where the interaction between the 
variables was significant [Table 7: X. 2 (60) = 1091.69, p < 0.001; 
Table 8: x 2 (60) = 1490.96, p < 0.001; Table 9: x 2 (60) = 1769.28, 
p < 0.001; Table 10: x 2 (60) = 1328.55, p < 0.001]. 

The groups did not differ greatly from the general tendency of 
the whole sample. Nevertheless some differences, which explain 
the results of the log-linear analyses, can be noted. All the four 
groups generally tended not to consider Dl pictures. However, 



whereas teachers, illustrators, and control participants had a high 
score in the M category, students were the only group who 
appeared to prefer the Decorative category (18.89% of students 
described as Decorative pictures that were not even consid- 
ered by 38.33% of other participants). This was consistent with 
Mayer's categorization, since our sub-samples tended not to con- 
sider those pictures that, according to Mayer, have no cognitive 
function. 



Table 6 | Coherence among the three requests regarding picture utility. 

Perfect Book Most useful pictures Least useful pictures Means of utility scores 



High utility 
Medium utility 

Low utility 



Organizational, Explanative 
Representational 

Decorative 



Organizational, Explanative 



Decorative 



Organizational, Explicative 
Representational first level, Representational 
second level 

Decorative first level, Decorative second 
level 



Table 7 | Frequencies and percentages of category attribution for illustrators. 



Functional categories 



D1 



D2 



R1 



R2 



Not considered 



59 (z = 7.28) 
37.34% 



19 

9.90% 



6 (z = -3.60) 
2.46% 



7 (z = -3.61) 
2.74% 



59 (z = 4.74) 
16.76% 



12 (z= -2.04) 
3.95% 



13 (z = 3.81) 
8.23% 



22 (z= 3.95) 
11.46% 



3.29% 



6 

2.35% 



0 (z = -2.20) 
0% 



2 

0.66% 



D1 



D2 



m 



R2 



10 (z = 4.35) 

6.33% 

1 

0.63% 

4 (2 = -2.35) 
2.53% 

2 (2 = -3.17) 
1.27% 

0 

0% 



8 (z= 2.45) 
4.17% 

9 (z = 3.96) 
4.69% 

28 

14.58% 
20 

10.42% 

1 (z = -2.06) 
0.52% 



3 

1.23% 
0 

0% 

79 (z = 6.72) 
32.38% 

35 

14.34% 
4 

1.64% 



2 

0.78% 
0 

0% 
30 

11.76% 



4 

1.57% 



0 

0% 
0 

0% 

8 (z = -2.66) 
2.27% 

35 

9.94% 

60 (z = 7.02) 
17.04% 



0 

0% 
0 

0% 
16 

5.26% 
22 

7.24% 

19 (z = 4.31) 
6.25% 



10 (z = -2.46) 
6.33% 



9 (z = -5.28) 
4.69% 



33 

13.52% 



34 

13.33% 



95 (z = 4.37) 
26.99% 



171 (z= 9.27) 
56.25% 



Contextualization 



18(z = 3.81) 
11.39% 



37 (z = 4.36) 
19.27% 



14 

5.74% 



27 (z = 2.43) 
10.59% 



5 (z = -2.06) 
1.42% 



1 (z= -2.75) 
0.33% 



Motivational 



25 (z = 7.14) 



18 (z = 2.95) 





15.82% 


9.37% 




1.64% 


2.35% 


0.85% 


0.66% 


Operational 


0 (z= -2.21) 


1 (2=- 


-2.96) 


12 


40 (z = 3.87) 


41 (2 = 4.25) 


39 (z=5.10) 




0% 


0.52% 




4.92% 


15.68% 


11.64% 


12.83% 


Visual 


1 (2= -2.03) 


3(2=- 


-3.14) 


32 (2=3.01) 


29 (z = 2.44) 


28 (z = 2.33) 


17 (z=2.65) 




0.63% 


1.56% 




13.11% 


11.37% 


7.95% 


5.59% 


No function 


15(2=4.01) 


17 




14 


17 


18 


3 




9.49% 


8.85% 




5.74% 


6.67% 


5.11% 


0.99% 
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Table 8 | Frequencies and percentages of category attribution for teachers. 



D1 



Functional categories 



D2 



R1 



R2 



Not considered 



77 (z = 9.06) 
42.78% 



41 (z= 2.74) 
18.98% 



7 (z = -5.07) 
2.46% 



12 (z= -3.16) 
4.17% 



59 (z = 3.58) 
14.90% 



8 (z = -2.82) 
2.34% 



22 (z= 5.62) 
12.22% 



30 (z= 5.62) 
13.89% 



14 

4.91 % 



7 

2.43% 



3 

0.76% 



0 (z = -2.32) 
0% 



D1 



5 (z = 3.49) 
2.77% 



7 (z = 2.75) 
3.89% 



4 

1.40% 



0.35% 



0 

0% 



0 

0% 



D2 



2 

1.11% 



5 (z = 2.00) 
2.31 % 



5 

1.75% 



0.35% 



0 

0% 



1 

0.29% 



m 



4 

2.22% 



29 (z= 2.00) 
13.43% 



77 (z = 6.04) 
27.02% 



20 

6.94% 



19 

4.80% 



9 

2.63% 



R2 



Contextualization 



Motivational 



4(z = -2.18) 
2.22% 

0 

0% 

1 (z = -2.75) 
0.56% 



6.11% 

42 (z= 7.14) 
23.33% 



23 

10.65% 

0 (2= -2.05) 
0% 

5 (z = -3.44) 
2.31 % 

33 (z = 4.24) 
15.28 

25 

11.57% 



35 

12.28% 
2 

0.70% 
32 

12.23% 
25 

8.77% 
36 

12.63% 



79 (z= 5.88) 
27.43% 



1.39% 
22 

7.64% 

37 (z = 4.23) 
12.85% 

12 

4.17% 



20 

5.05% 

113 (z = 7.94) 
28.53% 

52 U=3.25) 
13.13% 

10 

2.52% 

4(z = -3.83) 
1.01 % 



28 

8.19% 

38 (z= 5.65) 
11.11% 

179 (z= 10.26) 
52.34% 

4(z = -1.98) 
1.17% 

10 

2.92% 



Operational 



1 (z = -3.33) 
0.46% 



10 (z = -2.03) 
3.51 % 



64 (z = 6.30) 
22.22% 



72 (z = 2.40) 
18.18% 



22 (z= 2.40) 
6.43% 



Visual 



0 (z = -2.23) 
0% 



7 

3.24% 



28 

9.82% 



26 

9.03% 



38 (z= 3.81) 
9.60% 



36 (z= 3.81) 
10.53% 



No function 



10 (z = 3.36) 
5.56% 



10 

4.63% 



10 

3.51 % 



3 (z = -2.23) 
1.04% 



6 

1.51% 



7 

2.05% 



Also D2 pictures were considered differently by the four 
groups. Teachers tended not to consider them, equating them 
with the Dl pictures. Illustrators saw them mainly as C pic- 
tures (19.27%). It is worthwhile noting that ascribing Decorative 
pictures to this category was a common tendency of the whole 
sample, even though the other subsamples did not use this cate- 
gory as often as the illustrators did. Students and control group 
participants tended to mistake Decorative pictures for Rl images. 

Rl pictures tended to be correctly detected by all groups. The 
teacher group was once again the only one that regularly hypoth- 
esized an emotional-motivation function. For the other groups, 
the MN category appeared to be one of the most used. 

R2 images tended to be correctly detected by all groups. Since 
many pictures in this category were used to illustrate exercises, 
the OP category was also often used by participants. Illustrators, 
unlike other participants, tended to mistake R2 images for E pic- 
tures (13.33%), showing that these individuals sometime overrate 
the cognitive function of such images. The control group showed 



a tendency to positively evaluate the visual impact of those pic- 
tures that often stand in for the text — and this is proved by their 
rating R2 images as C (14.09%) or MN pictures (14.45%). 

O pictures were correctly categorized by all groups except for 
the illustrators, who often mistook them for E ones. Thus, the ten- 
dency of this subsample to overrate the cognitive function of the 
pictures was confirmed once more. It is also interesting to note 
that all the participants, apart from the students, tended (when 
not assigning them to the right category) not to consider the O 
pictures as proper illustrations. This can be explained by the fact 
that many O pictures were schemata or graphs, and thus possibly 
more familiar to students than to other groups. Teachers and stu- 
dents, who are more used to employing the kind of illustrations 
analysed in the present study for their practical work than the 
other participant categories, also used the OP category to describe 
those pictures. 

E pictures seemed to be the most easily and univocally recog- 
nized, gaining more than 50% identification in every group. 
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Table 9 | Frequencies and percentages of category attribution for students. 



D1 



Functional categories 



D2 



/?1 



R2 



Not considered 



69 (z = 8.09) 
38.33% 



33 (z = 2.24) 
15.28% 



6 (z = -4.51) 
2.10% 



9(z = -3.18) 
3.12% 



50 (z = 3.56) 
10.27% 



8 (z = -2.34) 
2.44% 



34 (z = 6.39) 
18.89% 



31 (z = 4.48) 
14.35% 



19 (z= 1.99) 
6.64% 



7 

2.43% 



0 (z = -2.52) 

0% 



1 (z = -2.09) 
0.31 % 



D1 



12 (z = 4.97) 
6.67% 



13 (z= 3.94) 
6.02% 



4 

1.40% 



0 

0% 



0 

0% 



0 

0% 



D2 



3 

1.67% 



8 (z= 2.53) 
3.70% 



6 

2.10% 



2 

0.69% 



1 

0.20% 



1 

0.31 % 



fll 



0 (z = -2.36) 

0% 



41 (z = 3.89) 
18.98% 



82 (z = 5.64) 
28.67% 



28 (z= 2.04) 
9.72% 



12 

2.46% 



6 

1.83% 



HI 



Contextualization 



4 (z = -2.26) 
2.22% 

0 

0% 

1 (?= -2.81) 
0.56% 

13 



8.33% 

0(z = 
0% 



-2.08) 



7 (z = -2.52) 
3.24% 

32 (z = 2.87) 



30 

10.49% 
1 

0.35% 
16 

5.59% 

13 (z = 2.33) 



60 (z = 4.44) 
20.83% 

6 

2.08% 
16 

5.56% 

34 (z= 2.50) 



25 

5.13% 

200 (z = 9.29) 
41.07% 

59 (?= 34.41) 
12.11% 

14 



23 

7.03% 

36 (z = 5.35) 
11.01% 

168 (z= 10.88) 
51.38% 

7 (z= -2.11) 





722% 


14.81% 


4.54% 


11.81% 


2.87% 


2.14% 


Motivational 


21 (z = 4.99) 


16 


28 (z = 3.19) 


6 


2 (z = -2.84) 


4 




11.67% 


7.41 % 


2.79% 


2.08% 


0.41 % 


1.22% 


Operational 


3 


0 (z = -3.11) 


28 (z = -2.25) 


82 (z = 5.34) 


71 (z = 4.72) 


37 (z = 3.40) 




1.67% 


0% 


9.79% 


28.47% 


14.58% 


11.31% 


Visual 


0(z=-2.12) 


7 


48 (z = 3.42) 


30 


49 (z = 3.32) 


33 (z = 3.93) 




0% 


3.24% 


16.78% 


10.42% 


10.06% 


10.09% 


No function 


20 (z = 3.01) 


10 


5 


8 


4 


3 




11.11% 


4.63% 


1.75% 


2.78% 


0.82% 


0.92% 



Even if strong differences among subsamples failed to emerge, 
it is possible, nonetheless, to locate some tendencies. In general, 
whereas illustrators and teachers tended to have differing opin- 
ions, students and the control group often seemed to converge 
in their evaluations/interpretations. These last two subsamples 
tended to agree with Mayer's theories more frequently than did 
the others. Moreover, illustrators tended to overrate the cogni- 
tive function of all pictures, whereas teachers tended to stress the 
Motivational functions of the Decorative and Representational 
pictures. 

Pictures' utility 

A 6 x 4 MANOVA was conducted to explore the differences 
among the four subsamples in the attribution of utility scores 
(Table 11). The mean score attributed to D2 pictures was higher 
for illustrators and this difference was significant: F( 3 64 ) = 2.89, 
p < 0.05. The mean score attributed to Organizational illus- 
trations was higher for students and this difference was also 



significant: F($ t 64) = 3.27, p < 0.05. A posteriori Turkey tests 
showed that the mean scores attributed to O pictures differed sig- 
nificantly between illustrators and students (p < 0.05). No other 
difference was statistically significant. 

DISCUSSION AND CONCLUSIONS 

Up to now we have no empirical data about the conceptions that 
people possess about text-picture combinations devised for edu- 
cational purposes. The present study was carried out to fill such 
a lack of data by trying to thoroughly investigate people's beliefs 
regarding the role and effectiveness of illustrations in children's 
textbooks. Our aim was to record participants' opinions by asking 
them to consider concrete cases. An open interview, based on spe- 
cific pictures taken from actual textbooks, secured more trustwor- 
thy data than that obtained with less ecological methodologies, 
also due to a greater ecological validity of the employed materials. 
Considering the fact that beliefs are often not available as abstract 
conceptualizations, we did not ask participants to provide only 
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Table 10 | Frequencies and percentages of category attribution for the control group. 



D1 



Functional categories 



D2 



/?1 



R2 



Not considered 



79 (z = 7.81) 
49.37% 



31 

13.60% 



8 (z = -3.34) 
3.32% 



18 

7.03% 



60 (z = 3.41) 
16.57% 



9 (z = -2.66) 
2.97% 



5 (z = 2.12) 
3.12% 



22 (z = 4.72) 
9.65% 



7 

2.90% 



2 

0.78% 



0 

0% 



0 

0% 



D1 



2 

1.25% 



6 (z = 3.04) 
2.64% 



0 

0% 



0 

0% 



0 

0% 



0 

0% 



D2 



0 

0% 



0 

0% 



0 

0% 



0 

0% 



0 

0% 



m 



4 

2.50% 



59 (z = 6.12) 
25.88% 



58 (z = 6.04) 
24.07% 



16 

6.25% 



5 (z = -2.56) 
1.38% 



3 (z = -2.61) 
0.99% 



R2 



Contextualization 



Motivational 



6 (z = -2.40) 
3.75% 

1 

0.62% 

5 (2= -2.79) 
3.12% 

17 

10.62% 

20 (z = 3.86) 
12.50% 



24 

10.53% 
2 

0.88% 

5 (z = -4.43) 
1.74% 

42 (z = 2.74) 
18.42% 

9 

3.95% 



39 (z = 2.39) 
16.18% 

0 

0% 
24 

9.96% 
29 

12.03% 
16 

6.64% 



68 (z = 4.70) 
25.56% 

5 

1.95% 
19 

7.42% 
36 

14.06% 
11 

4.30% 



16 

4.42% 

114 (z = 8.47) 
31.49% 

61 (z = 3.67) 
16.85% 

7 (z= -3.32) 
1.93% 

6 

1.66% 



16 

5.28% 

28 (z = 4.79) 
2.24% 

170 (z = 9.72) 
56.11% 



3.63% 



2.64% 



Operational 



1 

0.62% 



0 (z = -2.53) 
0% 



9 (z = -2.20) 
3.73% 



37 (z = 4.68) 
14.45% 



31 (z = 4.06) 
8.56% 



3.63% 



Visual 



3 (z = -3.42) 
1.87% 



10 (z = -3.36) 
4.39% 



40(z = 2.41) 
16.60% 



37 

14.45% 



51 (z = 3.25) 
14.09% 



35 (z= 2.64) 
11.55% 



No function 



17 (z = 2.01) 



16 



7 (z = -3.02) 



verbal answers, but also requested a response through practical 
actions, such as selecting pictures and positioning them to create 
an original product. 

Three main issues were addressed in the study: (1) the descrip- 
tion of the conception about multimedia learning which emerged 
from participants' responses, (2) the internal consistency of such 
conception, and (3) the possible differences in such conception 
depending on the level of the respondents' expertise. 

FEATURES OF THE CONCEPTION ABOUT MULTIMEDIA LEARNING 

As far as the first issue is concerned, participants' responses 
allowed us to extend the frame in which beliefs about multimedia 
learning can be conceptualized. More precisely, the study led us to 
identify new categories, to use alongside those devised by previ- 
ous authors, in which pictures can be classified according to their 
functional role described in terms of the mental processes which 
the text-picture combination activates. As a result, a comprehen- 
sive and systematic set of kinds of illustrations, each associated 
with the corresponding cognitive mechanism, is proposed. 



Two categories proved to be the most interesting: on the one 
hand, the Motivational category, and, on the other hand, the 
Mnemonic. The first one converges with a cultural-pedagogical 
position (Korat, 2001), which ascribes importance to a strong 
link between social-emotive aspects (self-identity, use of narra- 
tives linked to a child's life, social interaction) and cognitive 
skills to promote a child's literacy. This correspondence might 
mean that "non-expert" people have deep-rooted naive theo- 
ries about themes related to cognitive psychology, and that these 
theories — whereas diverging from those accredited by cognitive 
psychology — have a rationale of their own. The Mnemonic cat- 
egory, on the other hand, has an iconic function, known as the 
transformation function, which was identified by Levin (1983), 
but later disregarded by Mayer (1993). According to Levin, pic- 
tures related to this category are designed to facilitate the mem- 
orization of the text to which they refer. The visual recoding 
of the information allows the reader to spontaneously use some 
sort of visual mnemonic technique. Pictures focus the atten- 
tion on the main points of the content to be acquired, thus 
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Table 11 | Mean scores and SDs in the different subsamples. 





o uuaai iiuic 


Mean 


SD 


Decorative 1 


Teachers 


1.94 


0.17 




Illustrators 


2.21 


0.18 




Students 


1.69 


0.17 




v u 1 1 ii \J\ 


1 75 


0 18 


Decorative 2 


Teachers 


2.87 


0.13 




Illustrators 


3.34 


0.13 




Students 


2.87 


0.13 




r\r\trr\\ 

V U I I LI tJI 


3 05 


0 13 


Representational 1 


Teachers 


3.98 


0.11 




Illustrators 


3.95 


0.12 




Students 


3.95 


0.11 




r\r\trr\\ 

I^UI I II \J\ 


3 96 


0 12 


Representational 2 


Teachers 


4.01 


0.11 




Illustrators 


3.85 


0.12 




Students 


4.13 


0.11 




Control 




U. I Z 


Organizational 


Teachers 


3.76 


0.13 




Illustrators 


3.40 


0.14 




Students 


3.92 


0.13 




Control 


3.51 


0.14 


Explanative 


Teachers 


4.21 


0.12 




Illustrators 


4.12 


0.13 




Students 


4.42 


0.12 




Control 


4.28 


0.13 



providing people with methodical ways to memorize data and 
making the data more concrete and storable. Levin (1983) also 
claimed that such pictures link different and distinct parts of 
information, forming a well-organized global context. This last 
description seems to be suitable for those Organizational pictures 
(such as geographical maps or hierarchical structures), which 
many interviewees put into the Mnemonic category. The same 
definitions employed by the interviewees to denote this cate- 
gory recall the key words used by Levin (1983). For example: 
<CThe picture promotes the visualization^, or <^This picture 
allows the learner to recall the corresponding textJS>, or even 
<^The illustration gives a prompt idea of the content and helps to 
memorize it^>. 

The overall view which emerged allowed us to draw a com- 
plete taxonomy of the functions that pictures play in multime- 
dia instructional tools supported by empirical evidence. Such a 
taxonomy can be described by highlighting the cognitive pro- 
cesses which underpins such functions. The taxonomy can be 
conceived as hierarchically organized, moving from the anal- 
ysis of what illustrations allow learners to represent in their 
mind (something which is mentioned versus not-mentioned in 
the text) to the features of such a representation (restricted to 
the visualization of a single element versus including also rela- 
tionships with other elements) and, finally, to the goal of such 
representation (just displaying the whole system of elements 



mentioned in the text versus explaining the dynamics of such 
a system). 

INTERNAL COHERENCE OF THE CONCEPTION ABOUT MULTIMEDIA 
LEARNING 

Participants' responses to different requests concerning the func- 
tions and utilities of pictures were always coherent, emphasizing a 
systematic nature of their beliefs. In particular, a strong coherence 
between the perceived utility among the three requests emerged. 
The equal use of Representational and Motivational pictures in 
building the Perfect Book mirrored the positive judgment peo- 
ple ascribed to them and the ease with which the participants 
identified the pictures' role in education. 

Thus, we can argue that the beliefs concerning multimedia 
learning are well-structured — so as to constitute a sort of naive 
theory in which opinions about the mental processes elicited by 
text-picture combinations are consistently associated with opin- 
ions regarding the cognitive functions they play and their efficacy 
within the general economy of mental work implied in learning. 

This coherent view developed by non-expert persons is not 
so far from Mayer's theory. According to this author, laypersons 
develop beliefs about the mechanisms of multimedia learning 
(the so-called common sense theory), which conflicts with what 
is supported by experimental research. According to the com- 
mon sense theory of multimedia learning, pictures are included 
in instructional materials just to convey information and they 
have no added value. In contrast to this view, our data supports 
the notion that people (i) identify further, and deeper, func- 
tions played by illustrations, (ii) associate these functions with the 
specific underlying cognitive processes (iii) which are meant to 
be addressed to specific goals (expressed in psychological terms: 
amusing, memorizing, understanding, and so on), (iv) so as to 
be able to judge properly how useful pictures can be in order to 
enhance or impede learning. 

DIFFERENCES AMONG SUBSAMPLES CONCERNING THE CONCEPTION 
ABOUT MULTIMEDIA LEARNING 

Strong differences between the interpretations of multimedia 
experts and non-experts failed to emerge, thus supporting the 
notion that folk ideas concerning multimedia learning are quite 
generally shared and robust. It was possible, nonetheless, to high- 
light some significant differences. In general, while illustrators 
and teachers tended to have differing opinions, students and the 
control group often seemed to converge. 

These last two subsamples tended - more frequently than the 
others - to agree with Mayer's theory. This data seems to be par- 
ticularly relevant. Mayer, in fact, predominantly employed univer- 
sity students as his subjects. From our study the idea that people 
have naive theories that correspond to the common sense theory 
(as Mayer — 2001, 2005 — hypothesized) seems to be refuted. On 
the contrary, it appears as if this sub-sample comes very close to 
the core concept of the multimedia theory, in which the role of 
individual beliefs is important, even more since they are quite 
adequate with respect to multimedia principles. Moreover, the 
study helps to understand how much of Mayer's multimedia the- 
ory is spontaneously recognized by non-experts and by people 
employed in the didactic field. 
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NAIVE CONCEPTIONS ABOUT MULTIMEDIA LEARNING AND MAYER'S 
COMMON SENSE THEORY 

The naive conceptions of psychological mechanisms involved 
in multimedia learning, which were investigated in the present 
study, allow us to point to a fairly high correspondence between 
people's views and Mayer's model. This is especially true for those 
categories (O and E) that seem to have the greatest influence on 
cognitive processes. Decorative and Representational pictures, on 
the other hand, are more often ascribed to different and naive 
categories. Concerning the attribution of function to different 
images, data leads us to infer that the pictures that, according 
to Mayer, involve more complex cognitive processes (such as O 
and E pictures), were recognized with greater ease by the partic- 
ipants. However, the opposite was true for the pictures that, in 
Mayer's classification, do not have any cognitive function: sub- 
jects tended to ascribe different and "personal" functions to these 
decorative pictures. By examining the participants' utility scores, 
one can see that Dl pictures were considered to be less useful, 
while the E pictures appeared to be perceived as being the most 
useful. The most and least useful pictures endorsed by intervie- 
wees were always coherent with Mayer's functional classification: 
E pictures were regarded as the most useful, and Decorative as 
the least useful. Examination of the Perfect Books built by partic- 
ipants revealed sound consistency with Mayer's principle of spe- 
cial contiguity: this correspondence was maintained in 100% of 
the cases. 

The match between people's conception and Mayer's experi- 
mentally supported theory may be attributed to several factors, 
not least because Mayer often focused his research on scien- 
tific disciplinary fields or on differences between the American 
and Italian educational systems and cultures. Moreover, historical 
factors cannot be disregarded; the studies, allegedly support- 
ing Mayer's pessimistic view about the common sense theory 
(Mayer, 2001), were performed more a decade apart, and a more 
widespread diffusion of multimedia tools in everyday life could 
have provided people with the opportunity to gain a broader and 
more sophisticated life experience, and, consequently, a better 
understanding of multimedia principles. 

IMPLICATIONS 

On a practical level, this study appears to be useful for sev- 
eral reasons. First of all, it is important to consider the naive 
theories of graphic artists who are active participants in the real- 
ization of textbook illustration. If their conceptions are "wrong" 
(that is, not coherent with cognitive theories, and hence refer- 
ring to misleading cognitive principles), the illustrators them- 
selves will contribute to the creation of cognitively dysfunctional 
textbooks. 

Teachers' conceptions are equally important, as it is teachers 
who actually choose the textbooks and decide how to use them 
in classroom. Thus, an adequate conception is important to guar- 
anteeing a good use of pictures as a cognitive aid (e.g., being able 
to point out to students which pictures it can be useful to focus 
on, which pictures are to be studied, which are to be explained in 
relation to the text, and so on). 

Finally, students, if they possess relevant systems of beliefs, can 
be conceived as competent self-regulated learners, who are able 



to decide by themselves which illustrations they must pay atten- 
tion to according to the goals they should reach and to identify 
the mental work which is associated with each kind of image. 
Furthermore, the possession of an adequate conception should 
enable learners to devise efficient multimedia presentations, a 
task which is increasingly assigned to students in contemporary 
instructional settings. 

This study provided suggestions to involve instructional tool 
designers, teachers and students in reflections about the ade- 
quacy of the multimedia tools which they manage. Firstly, a 
tested framework to distinguish among different kinds of illus- 
trations is provided. Secondly, the procedure (as well as the kind 
of tasks and questions) used in this investigation can be applied 
in school settings to highlight the implicit ideas teachers and 
learners have about the alleged effects of multimedia devices 
and to verify if they match the cognitive principles supported by 
empirical research. Thirdly, the naive conception which emerged 
in this study can be assumed as a reference point to question 
the conception that teachers and students share and to hint at 
discussing it. 

Another interesting implication can be derived by the addi- 
tional categories added to the original taxonomy, and derived 
from participants' answers. Many of these categories, as discussed 
above, are closely related to everyday learning practice, and clearly 
derived in a cognitively sensible way from participants' expertise. 
Insights from these new categories should be operationalized both 
by multimedia designers and users' of multimedia products. For 
example, the first "new" categories — contextualizing and moti- 
vational pictures — takes into account the socio-emotional side of 
the learning process, disregarded by traditional cognitive theories. 
In everyday learning, though, this attention could be extremely 
useful to promote effective learning. The other two categories 
named by our sample, operational and mnemonic, are more on 
the cognitive side, but should be used as useful hints by designers, 
because they stress two important steps of the learning process, 
especially in the early literacy steps: operationalize complex pro- 
cedures/concepts and memorize information. Being able to direct 
effectively the use of multimedia elements to would be extremely 
useful. 

LIMITS AND FURTHER STEPS 

We have to take into account the limitations of this study. It 
was a preliminary attempt to analyse naive conception associ- 
ated with the use of multimedia tools for instructional aims. 
Since no previous investigation focused on beliefs about the cog- 
nitive aspects of text-picture presentation has been carried out, 
we had no specific theoretical or methodological model to be 
taken as a frame of reference. We followed a general phaenomeno- 
graphic approach (Marton, 1981) aimed at inducing respondents 
to express the sense they attribute to that piece of experience 
which concerns multimedia learning and the deep convictions 
they possess about it. On the methodological side, we devised 
an original procedure to address this issue. We tried to com- 
bine different procedures, ranging from verbal to concrete tasks, 
as well as from open to closed questions (in the form of both 
dichotomous selections and Likert scales). We hoped to facili- 
tate the interviewees in expressing their conceptions by means 
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of various, complementary ways of answering, so compensating 
for the limits of each procedure. On the other hand, however, we 
made a precise methodological choice in deciding to reconstruct 
the participants' view by making reference to specific instances 
of text-picture presentation, but not to ask questions about mul- 
timedia tools in general, since we thought that the former, but 
not the latter procedure would enable us to go beyond super- 
ficial opinions and to test deeply the respondents' convictions. 
Obviously this makes the investigation rather contextualized, so 
raising problems with the generalization of the findings. The fact 
that the most widely adopted textbooks were selected and that a 
large set of text-picture combinations was employed in the study 
should assure that the material considered here was representative 
of the instructional tools usually employed in school settings, so 
mitigating the limitation in question. However, the issue of con- 
textualization can be meant also in another sense. Metacognitive 
knowledge in itself is contextualized ( Annevirta and Vauras, 2006; 
Anderson et al., 2009) in the sense that beliefs about the opti- 
mal way to combine texts and pictures may vary according to the 
specific learners' goals, skills, and habits, as well as according to 
teachers' and tasks' requests. 



This study provided a first overview of the naive conceptions 
about text-picture combinations. Further studies might investi- 
gate this topic by testing whether these conceptions are influenced 
by the type of experience people have of multimedia tools and by 
the levels of previous knowledge and ability, as well as by cog- 
nitive styles. Another interesting evolution of the present study, 
from a methodological point of view, could be to vary the goals 
given to the participants while rating the pictures, asking individ- 
uals randomly to rate them imagining a use linked "to study" vs. 
"to design good text" and see whether differences emerge between 
the designers and the learners. An interesting way to deepen this 
issue even further in future research could be by exploring not 
only beliefs, but also how beliefs are related to use of learn- 
ing materials, using a mixed method design. Another natural 
development of the present study would be of applying a simi- 
lar methodology to multimedia instructional material presented 
using different kind of devices: would individuals' conception 
change if they are requested to evaluate an illustration presented 
as printed illustration in a textbook, or if the same picture is pre- 
sented within a multimedia didactic tool, a blog, a social media or 
within an instructional application for a tablet? 
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